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ABSTRACT 

We present near-infrared emission line counts and luminosity functions from the HSTWFC3 Infrared 
Spectroscopic Parallels (WISP) program for 29 fields observed using both the G102 and G141 grism. 
Altogether we identify 1048 emission line galaxies with observed equivalent widths greater than 40 
A, 467 of which have multiple detected emission lines. We use simulations to correct for significant 
(>20%) incompleteness introduced in part by the non-dithered, non-rotated nature of the grism 
parallels. The WISP survey is sensitive to fainter flux levels (3-5 x 10 -17 ergs s" 1 cm" 2 ) than the future 
space near-infrared grism missions aimed at baryonic acoustic oscillation cosmology (1-4 xlO" 16 ergs 
s -1 cm -2 ), allowing us to probe the fainter emission line galaxies that the shallower future surveys 
may miss. Cumulative number counts of 0.7 <z< 1 .5 galaxies are in rough agreement with the corrected 
count predictions of Ha emitters from Geach et al. (2010), reaching 10,000 deg" 2 above an Ha flux 
of 2xl0" 16 ergs s" 1 cm" 2 . Ha-emitting galaxies with comparable [Oin] flux are roughly 5 times less 
common than galaxies with just Ha emission at those flux levels. Galaxies with low Ha/[Oin] ratios 
are very rare at the brighter fluxes that future near-infrared grism surveys will probe; our survey finds 
no galaxies with Ha/[Oin] < 0.95 that have Ha flux greater than 3xl0 -16 ergs s" 1 cm" 2 . Our Ha 
luminosity function contains a slightly higher number density of faint line emitters than the NICMOS 
near-infrared grism surveys, likely a result of the smaller equivalent widths to which the WISP survey 
is sensitive. We also find that our high redshift (z— 0.9-1.5) counts are slightly higher (factors of 1.3-2) 
than the high redshift (z=1.47) narrow band Ha survey of HiZELS (Sobral et al. 2012), although our 
lower redshift luminosity function (z=0.3-0.9) is consistent with their z=0.84 result. The evolution in 
the [Oni] luminosity function from z=0.7-2.3 is almost entirely in the L* parameter, which steadily 
increases with redshift over that range. 

Subject headings: galaxies: evolution, galaxies: high-redshift, infrared: galaxies 



1. INTRODUCTION 

The majority of star formation, supermassive black 
hole accretion, and galaxy assembly in the universe likely 
occurred over the epoch of z=0.7-2 (e.g., McLure et al. 
2006, Perez-Gonzalez et al. 2008, van Dokkum et al. 
2010, Magnelli et al. 2011). A careful census of the spa- 
tial distribution of galaxies in this redshift range is crit- 
ical for measuring the large-scale clustering of galaxies 
that results from baryonic acoustic oscillations (BAO), 
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which will enable us to probe the expansion history of 
the universe and address the equation state of dark en- 
ergy (Eisenstein & Hu 1998, Cole et al. 2005, Eisenstein 
et al. 2005, Weinberg et al. 2012). Large near-infrared 
spectroscopic surveys are required to study this redshift 
regime, as the most luminous nebular emission lines move 
out of the optical at these redshifts. 

Ground-based searches for emission lines from faint 
high-redshift galaxies are severely impacted by the bright 
near-IR airglow, which effectively eliminates the possi- 
blity of slitless grism spectroscopy. Most near-infrared 
ground spectroscopy is done a single object at a time, 
requiring some form of pre-selection that generally biases 
against discovery of the highest equivalent width sources. 
In the near future, multi-object near-infrared spectro- 
graphs on the world's largest telescopes (i.e. MOSFIRE, 
FLAMINGOS-2, MOIRCS, EMIR, KMOS, etc.) will al- 
low for the study of a much greater number of galaxies, 
but even then they will be subject to the atmospheric 
transmission windows and an inability to discover the 
emission line objects with the most extreme equivalent 
widths. 

An alternative method to spectroscopy for identify- 
ing large numbers of z>0.7 emission line galaxies are 
wide-field narrow band searches targeting the transmis- 
sion windows in our atmosphere, e.g. HiZELS (Geach 
et al. 2008, Sobral et al. 2009, Sobral et al. 2012) and 
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CANDELS-COSMOS (Lee et al. 2012). These surveys 
have the advantage of very high sensitivity to emission 
lines (^lxlCP 17 ergs s -1 ), but suffer from an inability to 
map outside their narrow redshift ranges, making them 
unfeasible for BAO studies. In addition there is the dan- 
ger of significant contamination from emission lines at 
different rcdshifts, a well known issue from high redshift 
Lya emission line searches (Martin et al. 2008, Henry et 
al. 2012). For instance, the typical Ha/[Om] 5007 A ra- 
tio decreases with redshift (Ly et al. 2007, van der Wei 
et al. 2011, Dommguez et al. 2013). A carefully con- 
structed experiment, with multiple photometric bands 
along with additional narrow bands targeting another 
expected line at the same redshift, can increase the nar- 
row band reliability (e.g., Lee et al. 2012, Sobral et al. 
2012), although surveys dependent on continuum detec- 
tions will still miss the lowest mass galaxies. 

Of course the best way to avoid the limitations of the 
atmosphere is to observe in space. Two future space mis- 
sions, ESA's Euclid (Laureijs et al. 2012, Laureijs et al. 
2011) and NASA's WFIRST (Green et al. 2012, Dressier 
et al. 2012), both utilize large area, near-infrared grism 
surveys to investigate BAO, galaxy evolution, and star 
formation history. In addition to identifying the redshift 
of galaxies, near-infrared spectroscopy also allows ac- 
cess to the wealth of stellar evolution diagnostic features 
available in the optical: Ha, one of the most reliable in- 
dicators of star formation rate, the Balmer decrement, to 
determine extinction, as well as multiple metallicity indi- 
cators. These emission line surveys will cover thousands 
of square degrees of continuous redshift space, unbiased 
by the underlying continuum luminosity of the galaxy. 
However, these will not be the first near-infrared grism 
observations done from space, as the Hubble Space Tele- 
scope has had two instruments with near-infrared grisms. 

The Near IR Camera and Multi-Object Spectrome- 
ter (NICMOS, Thompson & Schneider 1998) G141 grism 
pure-parallels program (McCarthy et al. 1999, Yan et al. 
1999, Shim et al. 2009), surveyed -170 arcmin 2 , iden- 
tifying 113 emitters, and measuring the Ha-luminosity 
function from 0.7< z <1.9. The NICMOS data had a 
tiny (< 1 arcmin 2 ) field of view, low spectral resolution 
(R~100), and relatively poor detector sensitivity. The 
Wide Field Camera 3 (WFC3; MacKenty et al. 2010) in- 
strument is an improvement in all three areas, providing 
a 20-fold survey efficiency gain for the presently on-going 
WFC3 Infrared Spectroscopic Parallels program (WISP; 
Atek et. al 2010, 2011). While much smaller in area, 
the WISP survey is much deeper than the future space 
near-infrared grism surveys. There is presently no better 
laboratory for predicting what these future missions can 
expect. 

In this paper we discuss a sample of the WISP emission 
line objects identified from 0.85 - 1.65 /im. We discuss 
the survey, details of the emission line extraction, com- 
pleteness corrections, and estimates of redshift accuracy. 
We then present the Ha and [Oni] line-emitter number 
counts, Ha/[0m] ratios, and luminosity functions. Fi- 
nally, we present our final summary and conclusions. We 
assume an f2M=0.3, Ha=0-7 universe with H o =70 km 
s -1 Mpc -1 . All magnitudes are in the AB system. 

2. WISP SURVEY 



The WISP survey (PI=Malkan, GO-11696, GO-12283, 
GO-12568, & GO-12902) consists of HST WFC3 (Kim- 
ble et al. 2008) grism observations in uncorrelated high- 
latitude fields obtained in parallel mode while prime ob- 
servations are being obtained with the Cosmic Origins 
Spectrograph (COS; Osterman et al. 2011) or the Space 
Telescope Imager and Spectrograph (STIS; Woodgate et 
al. 1998). The parallel data include both grism spec- 
troscopy and associated near-IR and optical imaging. 
Depending on the length of the parallel opportunity, the 
WISP survey cither acquires spectroscopy with just the 
G141 (1.2-1.7 /im, R^130) grism or, in the case of longer 
opportunities, a combination of G141 and the G102 (0.8- 
1.2 fjxa, R~210) grism with roughly 2-3 times more inte- 
gration time spent on the higher resolution G102 grism. 
At a plate scale of 0.13 arcsec pixel -1 , the total field of 
view for each observation is 123" x 136". We note that 
one can only achieve the full grism wavelength resolution 
for compact sources, as any spatial extension of the ob- 
ject will also broaden the features within the spectrum. 
See Atek et al. (2010) for further discussion of the pro- 
gram. 

For this paper we present data from 29 separate fields 
where we have both G102 and G141 grism spectroscopy, 
covering a total of 135 arcmin 2 over 159 orbits. These 
fields, along with their integration times, are presented 
in Table 1. While other large G141 grism surveys have 
been approved by HST (e.g., 3D-HST and AGHAST; 
Brammer et al. 2012, Weiner 2012), no other program 
comes close to surveying the area that WISP does with 
such an extended spectral coverage (0.85-1.65 /im). 

The inclusion of the G102 grism doubles the wave- 
length and redshift range surveyed. More importantly, it 
provides both critical checks of the assumed redshift and 
multiple line ratios that can not be obtained using G141 
alone. [Oni] and Ha only fall together on the G141 grism 
over a very narrow redshift range (z=1.3-1.55), otherwise 
in the vast majority of cases G141-only observations only 
discover single emission lines. [Oni] lines of compara- 
ble or greater strength than Ha are common (Hu et al. 
2009, Atek et al. 2010, Dommguez et al. 2013), making 
a catastrophic misidentification likely. With both G102 
and G141, WISP is sensitive to Ha, [Om], H/3, and [Oil] 
over a wide range of redshifts, and for z=1.3-1.55, we 
are sensitive to all those lines simultaneously. 

3. EMISSION LINE EXTRACTION 

All the data were processed with the WFC3 pipeline 
CALWF3 (ver. 2.1) to correct for bias, dark, flat-field, 
and gain variations. Then, the slitless extraction package 
aXe 2.0 (Kiimmel et al. 2009) is used for the spectral 
extraction. A complete description of the data reduction 
steps is presented in Atek et al. 2010. 

To locate and identify all the emission lines within our 
spectra we applied an automatic line finding algorithm. 
We first fit a three-segment cubic spline to the contin- 
uum of our one-dimensional aXe-extracted spectra, using 
outlier rejection to avoid fitting any emission lines. We 
then subtract the estimated continuum from the spec- 
trum and divide the continuum-subtracted spectrum by 
the aXe estimate of the flux uncertainties for each pixel. 
This produces a signal-to-noise spectrum. We look for 
groupings of three or more contiguous pixels with an ex- 
cess above the continuum with signal-to-noise in each 
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TABLE 1 

Summary of WISP Field Observations 



Field 


RA 


Dec 


G102 


F110W 


G141 


F160W 




[HMS] 


[DMS] 


[sec] 


[sec] 


[sec] 


[sec] 


Parl7 


02:13:38.11 


+12:54:59.3 


3409 


534 


3409 


559 a 


Par55 


12:20:54.68 


-02:04:46.0 


6415 


909 


2809 


484 a 


Par62 


13:01:16.20 


-00:00:20.2 


4712 


734 


2006 


396 


Par64 


14:37:29.04 


-01:49:49.5 


5918 


1112 


2306 


456 


Par68 


23:33:33.04 


+39:21:20.5 


7721 


1215 


3009 


534 


Par69 


15:24:07.75 


+09:54:53.9 


5721 


1087 


2309 


431 


Par73 


14:05:12.86 


+46:59:19.9 


6118 


1034 


2509 


456 


Par74 


09:10:48.14 


+10:17:20.3 


5918 


1065 


2306 


431 


Par76 


13:27:22.17 


+44:30:39.3 


5515 


887 


2006 


406 


Par78 


23:28 34.06 


+05:10:28.3 


5318 


887 


2106 


406 


Par79 


01:10:08.96 


-02:25:16.2 


7521 


1187 


2809 


534 


Par81 


01:10:09.12 


-02:22:17.1 


7521 


1187 


2809 


534 


Par87 


09:46:46.39 


+47:14:58.2 


4915 


912 


1906 


406 


Par94 


22:05:26.66 


-00:17:48.5 


9024 


1624 


3309 


534 


Par96 


02:09:24.40 


-04:43:41.6 


28081 


4295 


11430 


1765 


Par97 


01:10:06.30 


-02:23:44.7 


5515 


859 


2109 


406 


Parll4 


10:40:58.09 


+06:07:31.0 


7221 


1137 


2909 


456 


Parll5 


ll.lO.JJ.UO 


-1-09-1 7-OQ (\ 


5215 


912 


2106 


381 


Parl20 


13:56:51.50 


+17:02:33.9 


4512 


837 


1806 


381 


Parl24 


18:32:28.28 


+53:44:50.9 


4618 


759 


1906 


406 


Parl29 


11:02:18.72 


+20:52:07.8 


4712 


762 


2206 


456 


Parl31 


10:48:22.94 


+13:03:50.5 


13039 


2171 


5215 


884 


Parl32 


11:26:19.80 


-01:43:22.1 


4315 


634 


1806 


356 


Parl35 


11:22:24.01 


+57:50:58.9 


4712 


862 


1906 


406 


Parl36 


12:26:28.84 


+05:23:02.9 


18857 


3036 


7318 


1137 


Parl43 


14:02:22.01 


+09:45:51.7 


10133 


1568 


4012 


759 


Par 146 


02:12:27.60 


-07:32:20.7 


4212 


887 


1706 


381 


Parl47 


23:58:19.51 


-10:15:04.6 


5418 


962 


2106 


406 


Parl67 


01:41:24.18 


+13:37:34.1 


4315 


659 


1806 


356 


a Used F140W as 


the direct imaj 


;e used for identifying counterparts 


in the G141 


grism. 







pixel greater than For an unresolved object, the 

emission lines can be as narrow as two pixels and so we 
adopt a more-stringent two-pixel criterion of two con- 
tiguous pixels with a signal-to-noise ratio of greater than 
V5 in each. To validate this method of finding candidate 
emission lines, we compared several samples of emission 
lines found automatically to those found by an intensive, 
spectra by spectra, visual inspection and found the au- 
tomated method did not miss any significant fraction of 
lines (<5%). 

The parallel nature of the WISP observations presents 
several challenges to emission line extraction. Foremost 
is the lack of dithering, which greatly complicates the 
mitigation of cosmic rays, hot or warm pixels, and other 
artifacts. As a practical matter, this means that the au- 
tomated line identification process has a very high con- 
tamination rate from false and/or spurious sources. 

To address the false detections we require the visual 
inspection of every candidate emission line by two team 
members, done independently and without consultation. 
After this initial inspection, we compare the emission line 
lists from the two reviewers and send each reviewer back 
a list of discrepancies, allowing them to make a second, 
more careful examination of any emission lines for which 
there is a disagreement. This exercise provides a good 
final agreement for most emission lines, but we find that 
for roughly 5% of the emission lines, consensus can not 
be reached. These uncertain lines are removed from the 
final sample, but are accounted for in our completeness 
corrections (see Section l3~Tj) . 

The visual inspection consists of examining both the 
extracted one dimensional and the original two dimen- 
sional spectra, as well as the continuum fits and measure- 



ments of signal-to-noise. There are only four conditions 
for which we exclude a candidate line: 

1) If the line is clearly an artifact, either a cosmic ray, 
a zero order image from a nearby bright source, a diffrac- 
tion spike, ghost, or some other artifact clearly not asso- 
ciated with the spectrum in question. 

2) If the continuum fit under the location of the emis- 
sion line is highly inaccurate, artificially increasing the 
significance of the emission line. Many of the spectra 
contain breaks, both real and caused by nearby contami- 
nation from overlapping sources, which often lead to con- 
tinuum errors of this sort. 

3) If the local noise variations are larger than the flux 
uncertainties used by the automated line identification 
software, measuring lines at a higher significance than 
local noise conditions actually warranted. While the 
WFC3 detector and grism are fairly uniform and well- 
behaved, we find variations across the detector and lo- 
cations where what appear to be non-Poissonian sources 
of noise exist. In these cases the automated recovery 
pipeline was likely to find every noise peak and call it a 
source. 

4) If the contamination from nearby overlapping spec- 
tra is so great that we can not determine what source 
is producing the emission line. However, in most cases 
even severe overlapping source contamination can be un- 
tangled. We use multiple lines spread across the G102 
and G141 grisms (only the real source produces the cor- 
rect wavelength ratios in those cases), small pixel offsets 
from source center, and discrepancies between the size 
of the object and the spatial extent of the emission line 
as evidence for assigning the confused emission line to 
its proper source. Only if the source remains ambiguous 
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Fig. 1. — Completeness of extracted emission lines as a function of: left) line flux signal-to-noise, middle) equivalent width, and right) 
radius in arcseconds. The dotted line in the middle equivalent width plot represents the chosen equivalent width cut. 



after all analysis is it excluded from the final sample. 

Single emission lines are assumed to be Ha, except in 
the few rare circumstances where the single line is clearly 
resolved into the [Om] 5007+4959 A doublet. Our simu- 
lations indicate that this is a good assumption and does 
not result in more than 6% of our Ha lines being misiden- 
tified as [Om] (see Section [3.31 below). However, this 
assumption does have a large effect on the recovery of 
high redshift (z>f.5) [Om] lines, requiring some addi- 
tional completeness corrections for those emission lines 
(see Section [37TTTJ) . 

In total, we extracted 1960 emission lines from the 29 
fields down to a typical flux limit of 3-5xl0~ 17 ergs s _1 
cm . For an emission line galaxy to be included in 
our final analysis we also require an observed equivalent 
wid th greater than 40 A (log EW = 1.6; see Section 
13.11 below) and a signal-to-noise greater than 5 for at 
least one of the detected emission lines, creating a final 
sample size of 1048 galaxies. Of these, 467 have multiple 
detected emission lines, including weaker [On], H/3, and 
SII. For our final sample of emission line galaxies, we 
detect Ha in 996 galaxies and [Om] in 280 galaxies. The 
overlap where we found both Ha and [Om] in the same 
galaxy is 228. 

3.1. Completeness Corrections 

Virtually all surveys suffer incompleteness from objects 
lost as the strength of their signals approaches the level of 
the noise. A slitless grism survey, like WISP, also suffers 
significant incompletenesss due to confusion from nearby 
bright sources. In addition to those two fundamental 
sources of incompleteness, emission lines can also be lost 
as part of the extraction methodology. These sorts of fail- 
ures include objects missed by the automatic line finding 
routine, objects removed due to redshift confusion, and 
objects incorrectly rejected by visual inspection. In or- 
der to derive final completeness corrections to account 
for all the forms of incompleteness that effect the grism 
data, we must simulate the entire line extraction process 
from beginning to end. 

We start with the generation of a set of emission 
line galaxy models, covering the full range of redshifts 
(z=0.3-2), brightnesses (19-26 F110W mag), spatial ex- 



tent (0.2-l"radius), equivalent widths (20-2000 A) and 
line fluxes (10 -17 - 10 -15 ergs s _1 cm -2 ) found in our 
WISP data set of emission line galaxies. We then place 
each model galaxy randomly into one of our actual pairs 
of WISP G102 and G141 grism images using the aXeSIM 
software (Kiimmel et al. 2007), before extracting it using 
the same methodology and pipeline as used for extract- 
ing all the real emission line galaxies. This means that 
first we identify the lines using the automated software 
and then two separate team members visually examine 
each spectrum to ensure it contains a real emission line. 
During the visual inspection we mix 15 model galaxies 
along with another 15 randomly chosen galaxies, such 
that the reviewer could not assume that each spectrum 
must contain a visible emission line. Overall we insert 
923 model galaxies (74 fields each with typically 10-15 
model galaxies) and, including the random galaxies, ex- 
tract twice that number. 

We find that the primary determinants of complete- 
ness for the emission line galaxy sample are the observed 
equivalent width (EW) and the line flux of the emission 
line. In practice, the latter translates to the signal-to- 
noise of the emission line flux measurement, as the back- 
ground noise varied from field to field by a factor of 4. 
We also find completeness had some dependency on the 
radius (defined as the major and minor axis averaged 
in quadrature) of the source galaxy, but that this is of 
secondary importance due to relatively small number of 
large (> 0.75") radius galaxies in the observed sample, 
approximately 0.5% . In Figure 1, we plot completeness 
as a function of these three parameters (line flux S/N, 
EW, and size), where each bin is a combination of re- 
covery rates for the model galaxies. We weight all the 
model galaxy recovery rates for each single parameter 
by the frequency in which galaxies with the other two 
parameters appear in our full sample of actual emission 
line galaxies. For instance, if the full sample has twice 
as many small EW objects as large EW objects, model 
galaxies with small EWs will be given twice the weight 
when deriving completeness for S/N or size. 

One immediate result from this simulation is that the 
recovery rate drops rapidly from 60% to <20% as the 
observed EW (A) of the emission line becomes less than 
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Fig. 2. — Plot of emission line flux signal-to-noise versus equivalent width for all the identified WISP emission lines. Red triangles are 
objects with Ha emission lines, green squares are objects with [Oiii] emission, and blue stars are objects where both lines are detected. 
Where both Ha and [Om] are detected, the parameters plotted are sums (EW) or quadratic sums (S/N) of the two lines together. Over- 
plotted as solid black circles are the model emission line galaxies used for our completeness simulations. The dashed lines are the limits in 
flux and EW applied to our sample. Emission lines that lie beyond those limits have been excluded from the final analysis. 



40 A (logEW =1.6; see Figure 1). In the rest frame of 
our z^l galaxies, this EW limit corresponds to 20 A, 
roughly two thirds that of the average Ha EW for spiral 
galaxies in the Local Volume (Lee et al. 2007). In the 
interest of not introducing extremely large (> 5) and un- 
certain completeness corrections to the number counts, 
we apply a cut of EW > 40 A to all of our analysis. Af- 
ter combining this EW cut with our requirement that all 
detected emission lines have a signal-to-noise > 5, the 
total number of model galaxies is 553. Of those model 
galaxies, we recovered 380, yielding an approximate re- 
covery rate of 69%. We note that for this combination of 
equivalent width and signal-to-noise, there are effectively 
no galaxies fainter than 25 magnitudes (as measured at 
FllOW-band, although the simulated galaxies are flat in 
F„) in the final sample we use for our analysis. 

In Figure 2, we present emission line EW as a function 
of signal-to-noise for both observed and model emission- 
line galaxies. One can see that the model galaxies span 
the same parameter range as the real galaxies, extend- 
ing farther to brighter lines with large EWs and fainter 
lines with smaller EWs in order to test how our sample 



completeness acts at these extremes. One complication 
is that over a significant portion of the redshift range 
(z=0.7-1.5), there are actually two strong emission lines, 
[Oiii] and Ha. In some cases [Oiii] is the more power- 
ful line, particularly as we go to fainter line fluxes (see 
Section 1431 below) . We therefore used model templates 
both where Ha is more powerful and ones where [Oiii] is 
the stronger emission line (Ha/[Om] = 0.75 - 4.3). 

One would expect a source with multiple lines to have 
a higher completeness than a single line source with the 
same line flux and EW. Therefore to determine the like- 
lihood of identification, we measure the combined signal- 
to-noise of both lines, add them quadratically, and use 
that total signal-to-noise for calculating our complete- 
ness. For the combined EW we use a simple sum of 
the two emission lines. We plot these combined val- 
ues of signal-to-noise and EW as blue stars in Figure 2, 
along with the single line cases of Ha (red triangles) and 
[Oiii] (green squares). In most cases one line is clearly 
stronger, so the final signal-to-noise used for the com- 
pleteness correction can usually be approximated as the 
signal-to-noise of just the brightest emission line. We do 
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Fig. 3. — Diagram of completeness as a function of both line 
flux signal-to-noise and equivalent width. Each box represents one 
of the two parameter bins used, with the measured completeness 
ratio number inside it, including its error. The larger the square 
box, the smaller the completeness corrections that were applied for 
those emission line galaxies. 



not use the other, weaker emission lines ([Oil], SII, H/3, 
etc.) for determining completeness, as these lines are 
never alone and their relative fluxes are almost always 
small compared to Ha and [Om] . 

We find that for even the highest signal-to-noise lines 
(S/N > 100), we reach only 90-95% completeness. The 
main reason for this is contamination, either from over- 
lapping spectra or large, bright objects in the direct im- 
ages used to identify the galaxy counterparts. Either the 
line is lost under an extremely bright spectrum (most 
fields have 3-5 moderately bright H<19 objects), the ob- 
ject lies in a complicated region where we are unable to 
determine which of the nearby, overlapping objects to 
associate with the emission line, or the contamination 
from overlapping sources has caused the automatic line 
finder to fail, i.e., Sextractor fails to find the source or 
the continuum fit fails badly. Altogether contamination 
from nearby sources explains about 75% of line extrac- 
tion failures. Most nearby source contamination issues 
would be resolved if data could be taken at two or more 
roll angles, such that the spectra do not overlap, but 
that is not an option for parallel programs. Another rea- 
son a strong line will fall out of our sample is if the two 
emission line reviewers can not come to consensus on the 
redshift of the object (10% of the bright extraction fail- 
ures), which most commonly occurs where it is unclear 
whether the line is Ha or [Om]. For instance, when a 
source is spatially extended the grism wavelength reso- 
lution is effectively lowered, making it difficult to distin- 
guish between the [Om] doublet and a single emission 
line. This sort of confusion almost never occurs in G102 
(R~210), suggesting that data taken with a higher spec- 
tral resolution would largely solve this problem as well. 
The remaining ~15% of bright emission line failures are 
mostly miscellaneous extraction errors by the visual line 
inspectors. 

In Figure 3, we plot the final completeness corrections 
used for all of our analysis, derived as a function of both 
the equivalent width and the signal-to-noise of the line 



flux. While the object size also has an effect on whether 
an emission line galaxy will be recovered, we found that 
accounting for galaxy size has little effect on the overall 
completeness corrections, as the largest galaxies make 
up only a tiny percentage of the sample, <2%, even af- 
ter accounting for their higher incompleteness. Or to 
put it another way, galaxy size is not very important 
to the overall completeness correction, because size com- 
pleteness corrections grow larger as the actual number of 
galaxies is decreasing, unlike EW or line flux, where the 
number of objects tends to grow just as their complete- 
ness corrections are significantly increasing. A three pa- 
rameter completeness correction is also not practical for 
the required double visual inspection of each suspected 
emission line. Each added parameter increases the hu- 
man workload geometrically. Instead, for each bin of EW 
and line flux signal-to-noise, we weight the input model 
galaxies by the frequency that each galaxy size appears 
in our observed sample list of emission line galaxies. 

3.1.1. [OIIIJ Completeness 

Generally we found that the completeness corrections 
we derived for Ha can also be applied to [Om] emission 
lines. While [Om] is a doublet with a distinctive line 
profile that one might expect could be more easily disin- 
guished from artifacts and noise, for low signal-to-noise 
and small EWs where the incompleteness becomes in- 
creasing important, [Om] becomes indistiguishable from 
a single emission line. 

This assumption that Ha and [Om] completeness can 
be treated the same breaks down for [Om] at redshifts 
of z >1.5. This is a result of the attribution of all single 
line emitters to Ha. Below z=1.5, Ha remains in the 
wavelength range that is detectable by the WFC3 grisms 
and in the vast majority of cases if we detect an [Om] 
emission line, the Ha emission line will be detectable as 
well. This is because even down to our faintest emission 
line detections, the Ha/[Oin] ratio is usually greater than 
1 and rarely less than 0.4 (see Section l4~2j) . Above z=1.5, 
the Ha emission line is redshifted out of our detectable 
wavelength range, so we lose the ability to confirm [Om] 
with the usually brighter Ha line. If the [Om] line is 
bright enough we can still identify it from its doublet 
line profile, but the faint [Om] lines will all be identified 
as lower redshift Ha lines by default. 

While this misidentification of high-z [Om] produces 
only a small contamination of Ha (see Section 13. 3[) , our 
simulations show we lose nearly 50% of our high-z [Om] 
emission lines because of our inability to resolve the dou- 
blet. More specifically, the fainter the emission lines the 
larger the incompleteness. We have therefore added an 
additional completeness correction to our z>1.5 [Om] 
emission lines of 2.0 for 0.6<log S/N<1.2 and 1.6 for 
1.2<log S/N<1.8. Higher signal-to-noise [Om] emission 
lines require no additional correction. 

3.2. Redshift Accuracy 

Using the simulations we can also get an estimate of 
the redshift accuracy which is possible with grism spec- 
troscopy of this wavelength resolution. Each emission 
line source is placed at a random position in the field, 
which introduces the same uncertainties on the wave- 
length solution as seen in real data, including unknown 
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Fig. 4. — Histogram of redshift accuracy for Ha- emitters taken 
from the simulations, where the accuracy is defined as {z ^ af , rve d - 
Zinput)/(l-+Zi n p U t). The solid black histogram is all Ha emitters, 
the dashed red histogram is those emitters found only in the G102 
grism, and the dot-dashed blue histogram is the emitters found 
only in the G141 grism. 

absolute positions and inexact object sizes. While this is 
not a measurement of the absolute wavelength accuracy 
- the model spectra are put in and taken out with the 
same software (aXe) and the same assumed wavelength 
solution - it is a reasonable test of the accuracy one can 
expect with this pixel sampling, extraction method, and 
signal-to-noise. 

We removed the emission lines flagged because the re- 
viewers could not agree on a redshift and then applied 
our S/N cut of 5 and EW cut of 40 A. We found that 
going to even higher S/N and larger EW made almost no 
difference in the redshift accuracies we derived. For this 
analysis we are not interested in the catastrophic errors, 
which represent contamination of false lines in our sam- 
ples rather than a real reflection of redshift accuracy, so 
we removed any objects with errors in 1+z > 0.5%. We 
discuss issues of contamination of false sources in Section 
13.31 We further note that there is an error in the aXe 
process of inserting and removing user-generated spectra 
that incorrectly shifts the output wavelength calibration 
by a single pixel, corresponding to 24 A in G102 and 
46 A in G141, which we have removed from all further 
analysis. 

In Figure 4, the histogram of (z observed - z input )/(l + 
Zinput) shows that the results are strongly peaked around 
zero. The median offset is -0.0002 with a standard de- 
viation of 0.0017, or roughly 0.2% accuracy in redshift. 
Modeling of baryonic acoustic oscillations suggests that 
the redshift accuracies required for such an experiment 
are more like 0.1% (Wang et al. 2010), which is also 
the requirement for both the Euclid and WFIRST mis- 
sions. However, it is important to recall that the WISP 
redshifts are a combination of two different near-infrared 
grisms (G102 and G141) with two different resolutions 
(R~210 and R/^130). If we split the redshift accuracy 
histograms by the grism from which it came, we find 
the higher resolution grism reaches accuracies of 0.13% 
(as opposed to 0.19% in G141), demonstrating that 0.1% 
redshift accuracy is possible if the wavelength resolution 



R > 200. 

It is important to note that the lack of sub-pixel dither- 
ing of the parallel observations prevents any meaningful 
resampling of pixels along the direction of the wavelength 
dispersion. Such resampling would provide a small im- 
provement in the overall wavelength resolution and would 
therefore also potentially increase the redshift accuracies 
that are possible. 

3.3. False Emission Line Contamination 

Identifying emission lines in grism spectroscopy fields 
without dithering or field rotation is a challenging task. 
Even the well-behaved WFC3 near-infrared arrays still 
have cosmic rays, hot pixels, and other phenomena that 
can mimic real emission lines. Therefore as we approach 
the detection threshold there is a real chance of contam- 
ination of our Ha line sample by false emission lines. To 
investigate this further we look again to the simulation 
data. For the completeness calculation, we track whether 
an emission line galaxy is detected and placed into our 
final emission line list. There is no requirement that in- 
put and output redshifts must match. A mismatched 
redshift indicates contamination. While an emission line 
was found, it is clearly not identical to the emission line 
that was input. We label all extracted simulated spec- 
tra with a difference between input and output (1+z) of 
more than 1% as contamination. 

Of the 3643 simulated Ha emission lines that meet all 
extraction criteria (S/N > 5, EW > 40 A), 312 are good 
emission lines found at the input redshift, a successful ex- 
traction rate of 85.7%. An additional 21 emission lines 
are at the correct input redshift if we assign them to be 
[Oiii] rather than Ha, for a contamination of Ha emission 
lines by [Oiii] of 5.8%. For the remaining lines there are 
no alternative emission lines with which they could be as- 
sociated, so they are contamination of non-emission lines 
to the sample. That gives us two contamination rates for 
our sample: 8.5% contamination by false emission lines 
and 14.3% total contamination rate of Ha emission lines 
by the combination of [Oiii] and false emission lines. 

This false emission line contamination rate decreases 
slightly as we increase the cut-offs of either equivalent 
width or signal-to-noise of the line fluxes, but contami- 
nation rate never improves to much less than 10%. For 
instance, whether we raise the equivalent width cut-off 
from 40 A to 200 A or raise the the signal-to-noise line 
flux cut-off from 5 to 20, the false emission line contam- 
ination rate drops by only 3.5%, to 10.8% total contam- 
ination. 

We note that one other potential contaminant is [On] 
3727 A. [On] emission is generally faint, with fluxes 
around 40% those of [Oiii] 5007+4959 A. For redshifts 
where [On] and [Oiii] both fall into the detectable near- 
infrared wavelength range (1.3<z<2.3), if we detect 
[On], we always detect [Oiii] as well. There are some rare 
cases where [On] 3727 A can be as much as 30% brighter 
than [Oiii], but that [Oii]/[Oiii] ratio is not large enough 
to make any significant number of [Oiii] emitters unde- 
tectable. Therefore we do not expect any contamination 

1 This number is slightl y les s than the 380 simulated emission 
lines referred to in Section 13.11 as it does not contain 16 weak Ha 
emitters that were found only because of their [Oiii] emission. 
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Fig. 5. — Histogram of all detected Ha line emitters as a function 
of top) line flux, middle) observed equivalent width, and bottom) 
redshift. The dashed blue lines are the raw, uncorrected counts, 
while the solid red lines are the counts after the correction for 
completeness. 

from [Oil] 3727 A below a redshift of z<2.3. However, 
the assumption that all single emission lines are Ha could 
potentially lead us to miss a population of very high red- 
shift (z>2.3) [Oil] emitters. These high redshift [Oll]- 
emitters would have to be very luminous, i.e., more than 
5 times the L* found for [On] at 2=1.47 (Ly et al. 2007), 
and should therefore be quite rare. By z=2, the number 
density of [On]-emitters has already dropped below 500 
deg -2 , suggesting it is unlikely there could be more than 
3-5 z>2.3 [On]-emitters hidden in our Ha sample. 

We do not make any separate corrections for these 
false emission line contamination rates in our analysis. 
The completeness corrections we apply are based on all 
the extracted emission lines from the simulations, includ- 
ing the contaminating lines. While the contaminating 
lines artificially inflate the number of model emission line 
galaxies recovered, which reduces the derived complete- 
ness corrections, we expect the same contamination rates 
in the actual data. If we used completeness corrections 
that did not include contaminating lines, we would incor- 
rectly produce final number counts that were too large. 
By leaving the contaminating lines in our completeness 
derivation, we produce completeness corrections that si- 



FlG. 6. — Histogram of all detected [OIII] line emitters as a func- 
tion of top) line flux, middle) observed equivalent width, and bot- 
tom) redshift. The dashed blue lines are the raw, uncorrected 
counts, while the solid red lines are the counts after the correction 
for completeness. We note that the faintest [OIII] flux bins consist 
mostly of secondary, fainter lines that have only been identified 
and included because they are companions to brighter Ha emis- 
sion lines. We make no attempt to correct for this incompleteness, 
choosing instead to restrict our analysis to the brighter lines. 

multaneously account for the effects of both incomplete- 
ness and contaminating lines. To put it another way, the 
completeness corrections we use are ~ 15% smaller than 
they would otherwise be because of the extra emission 
lines we added due to contamination. 

4. EMISSION LINE NUMBER COUNTS 

The most numerous emission line seen by far is that 
of Ha, which makes up two thirds of all those observed. 
Of the 996 Ha emission lines with EWs > 40 A and 
S/N > 5 , 577 (58%) are identified from a single line. 
While identifying Ha emission from a single line intro- 
duces the possibility of a misidentification, our simula- 
tions (see Section 3.3) demonstrate that this affects no 
more than 6% of our emission line galaxies. Since vir- 
tually all the misidentifications are single emission lines, 
this implies that roughly 10% of our single Ha emission 
lines are actually not Ha at all, but instead misidentified 
[Oiii]. While far from negligible, the level of [Om] emis- 
sion line contamination down to the flux levels probed 
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by this work (3-5xl0 -17 ergs s" 1 cm" 2 ) remains small 
enough (10% of single emission lines, 6% of all lines) that 
it can be easily accounted for in the statistical analysis. 

In Figures [5] and [6j we plot the sky density for Ha and 
[Oni] versus flux, equivalent width, and redshift in both 
the original raw counts and those corrected for complete- 
ness. Here we can see that there are significant complete- 
ness corrections applied across the entire range of fluxes. 

4.1. Emission Line Number Densities 

To derive the number density of the Ha emission lines 
in our survey, we employed the Vmax method (Felton 
1977), in which we derive the maximum distance for 
which an emission line of that absolute luminosity could 
be detected, depending on the flux limits of each field, 
and then translate that to a volume, Vmax ■ This can be 
translated into a density: 

*** = ^ E °( SNR > EW )ir— M 

" ' Vmax 

where C(SNR,EW) is the completeness correction for 
each detected emission line galaxy, as a function of signal- 
to-noise and equivalent width. We scale the volume of 
the entire sphere of the sky by the angular area observed, 
47r/f2, to reach the volume density for each object, 8 ga i. 
Then we add up all the 8 ga i to get the total volume den- 
sities, which we then use for derivation of the luminosity 
functions (see Section 4.3). 

Similarly, we also calculate area number densities, de- 
riving an Am ax for each source. Am ax is the total an- 
gular area from the survey for which the emission line flux 
is greater than that of the 5tr line limit of the field. For 
the brighter emission lines, Am ax will simply equal the 
angular area of the entire survey (29 fields in this case). 
Similar to the Vmax method, once the Am ax's are de- 
termined we add up all the 1 / Am ax values to determine 
the area number density. For calculation of both Vmax 
and Am ax , the detector area used is roughly two thirds 
(<~ 3 arcmin 2 ) of the total area of the WFC3 IR chip 
(4.65 arcmin 2 ), as objects are lost to the edges where ei- 
ther the 1st order spectrum falls off the chip (right side) 
or there are no direct images of galaxies available (left 
side). 

The primary surveys of the future near-infrared grism 
space missions, Euclid (Laureijs et al. 2012) and 
WFIRST (Green et al. 2012, Dressier et al. 2012), will 
both cover thousands of square degrees to depths signif- 
icantly shallower than the WISP program. The Euclid 
wide spectroscopic survey will cover 15,000 square de- 
grees over a wavelength range of 1.1-2 /im, with a reso- 
lution of R=250 and a 3.5 a line flux depth of 3xl0" 16 
ergs s _1 cm" 2 . At time of publication, the WFIRST mis- 
sion is still considering several competing designs, but the 
primary spectroscopic mission will likely cover at least 
2000 square degrees at a resolution of R=600, down to 
a 7 a line flux depth of lxlO" 16 ergs s _1 cm" 2 . The 
wavelength range for WFIRST will likely start between 
1.3 and 1.5 fim and end somewhere between 2 and 2.4 
fim. While this paper focuses on comparisons and pre- 
dictions for the future shallow wide surveys of Euclid and 
WFIRST, it should be noted that both missions will cer- 
tainly contain deeper surveys done over less area. For 
instance, the Euclid mission plan contains at least two 
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Fig. 7. — Cumulative number count density as a function of 
Ha line flux. The red squares and dotted-dashed line are all the 
0.3<2<1.5 Ha emitters, the blue circles and solid line are just 
Ha emitters between 0.7<z<1.5, while the green triangles and 
dashed line are Ha emitters with [Oin] line emission greater than 
1.7xl0 — 16 ergs s — 1 cm -2 (approximately la for the planned Eu- 
clid line flux limits). The dotted black line is the number count 
density prediction of Geach et al. (2010), divided by ln(10). 

20 square degree deep fields with planned depths very 
similar to the observations for the WISP program, with 
line flux limits around 5xl0" 17 ergs s" 1 cm" 2 . 

In Figure [71 we plot cumulative number counts ver- 
sus limiting flux for the WISP survey, covering the pri- 
mary range of interest (1-lOxlO -16 ergs s" 1 cm" 2 ) of 
these future surveys. Completeness corrections have 
been applied to all points. We present three cumulative 
number measurements: total Ha emitters (red squares), 
0.7<z<1.5 Ha emitters (blue circles), and 0.7<z<1.5 Ha 
emitters for which [Oin] is also detected (green trian- 
gles). At the bright flux end (10 -15 ergs s _1 cm -2 ) 
we find a factor of 1.5 times more Ha emission lines at 
0.3<z<0.7 than at 0.7<z<1.5. As we approach fainter 
fluxes the percentage of high redshift objects increases, 
with the number of 0.7<z<1.5 sources exceeding those 
at 0.3<z<0.7 around 2xl0" 16 ergs s" 1 cm" 2 . 

The WISP survey reaches shorter wavelengths (0.85 
/zm) than presently planned for these future space mis- 
sions. The Euclid short wavelength limit, 1.1 /xm, 
translates into an Ha redshift of z=0.7, so we use the 
0.7<z<1.5 cumulative number counts to predict what 
these future space missions will find, noting that the 
WFIRST number counts will be somewhat lower depend- 
ing on the exact low wavelength cut-off used. The 1.65 
/jm WISP cut-off is a shorter long wavelength cut-off 
than the 2 \xm or better planned for these future mis- 
sions, but that only means the WISP survey is not sen- 
sitive to the highest redshift Ha emitters. Because of 
the steep fall-off in number counts with redshift for the 
flux limits being planned, it is unlikely the cumulative 
Ha number counts presented will be much different even 
with longer wavelength sensitivity. 

For comparison to our 0.7<z<1.5 cumulative number 
counts, we plot the 0.75<z<1.9 number counts derived 
from the study of Geach et al. (2010), which is mostly 
based on HST NICMOS grism and near-IR narrow band 
number counts (Shim et al. 2009, Geach et al. 2008, 
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Fig. 8. — Cumulative number count density as a function of Ha 
line flux split into different redshift bins from 0.7<z<1.5. The solid 
blue line is all emitters from 0.7<2<1.5 and is unchanged from 
Figure The dashed light blue line is 0.7<z<0.95, the dotted 
purple line is 0.95<z<1.2, and the dotted-dashed orange line is 
1.2<2<1.5. 
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Fig. 9. — Cumulative number count density as a function of [OIII] 
line flux, split into different redshift bins. The red squares and 
dotted-dashed line is all emitters from 0.7<2<2.3, the blue circles 
and solid line are the number density counts from 0.7<2<1.5, while 
the orange triangles and dashed line are 1.5<z<2.3. 

Sobral et al. 2009). We have decreased the result from 
the published article by a factor of ln(10) to account 
for an error in the published values of used, a result 
of improperly converting <£>(logL) luminosity functions 
to the more standard $(L) luminosity functions. Once 
we account for this offset, we find agreement between 
their NICMOS-derived cumulative distribution and our 
distribution from WISP at the faint end, although there 
is some significant deviation brighter than 5xl0~ 16 ergs 
s _1 cm -2 , where WISP detects roughly a factor of 2-2.5 
fewer sources. We find approximately 2500 (18,500) Ha 
emitters deg -2 in the redshift range 0.7<z<1.5 down to 
3 (1) x 10" 16 ergs s" 1 cm' 2 . 

To predict the number of multiple emission line (Ha + 
[Oin] ) sources a future near-infrared slitless spectroscopy 
space mission could expect to find per square degree, one 



must first choose a flux limit down to which [Oni] will 
also be detectable. This detection limit will vary de- 
pending on survey depth and secondary line reliability 
requirements. We can start with the presently planned 
Euclid mission 3.5c detection limit of 3xl0 -16 ergs s _1 
cm~ 2 . If we make the further assumption that one could 
reach a lower significance limit of 2a for a secondary line 
if one already had a higher confidence line in hand, then 
one could detect [Oni] flux down to 1.7 xlO -16 ergs s _1 
cm~ 2 . We apply this [Oni] flux limit to the green cumu- 
lative count line in Figure producing a prediction for 
Euclid for the number density of emission line galaxies 
where both Ha and [Oin] will be detected. Down to an 
Ha flux of 3xl0~ 16 ergs s _1 cm~ 2 we find that roughly 
24% of the 0.7<z<1.5 Ha-emitters are also detected in 
[Oin], or 600 Ha + [Oin] emitters per square degree. A 
higher limit on the [Oni] flux will obviously exclude more 
multiple emission line objects from the final counts. For 
instance, if we require the [Oni] and Ha line flux to both 
be greater than 3xl0~ 16 ergs s _1 cm -2 , then the density 
of Ha + [Oin] emitters drops by half, down to around 
280 per square degree. 

With 314 Ha emission lines at z>0.7 with fluxes above 
10~ 16 ergs s _1 cm~ 2 , we have enough emission lines 
to break down the cumulative number counts into even 
smaller redshift bins in Figure |5J This more detailed 
redshift breakdown allows us to examine the evolution of 
the number counts that will be available to these future 
near-infrared slitless spectrographs in space. At a flux 
of 10~ 15 ergs s _1 cm -2 the 0.7<z<1.5 Ha emission line 
galaxies are split roughly equally between 0.7<z<0.95 
and 0.95<z<1.2. However, by 6xl0 -16 ergs s _1 cm~ 2 , 
the emitters from the 0.95< z <1.2 redshift range out- 
number those from 0.7<z<0.95 by a factors of 1.5-2 and 
remain more numerous down to fainter fluxes. Our high- 
est redshift bin, 1.2<z<1.5, has no emission lines as 
bright as 10~ 15 ergs s _1 cm~ 2 and is not a significant 
percentage of the emission line sample until fluxes less 
than ^3xl0~ 16 ergs s _1 cm~ 2 . 

We can also look at the number densities of the [Oin] 
emission lines on their own in Figure [9j Over the same 
range of line fluxes, the number counts of [Oin] emit- 
ters initially rise much more slowly than the numbers of 
Ha-emitters, before rising quickly for fluxes fainter than 
3xl0~ 16 ergs s _1 cm~ 2 . At 10~ 15 ergs s _1 cm -2 , there 
are 3 times more Ha than [Oni] emission lines, but by 
6xl0~ 16 ergs s _1 cm -2 that number density difference 
has increased to a factor of 7. Then, thanks to the steep 
rise of the fainter [Oni]-emitters, this number density 
disparity rapidly shrinks and returns to a factor of 3 for 
line fluxes of 10~ 16 ergs s _1 cm~ 2 . This steep rise in 
the relative [Oni] number density can also be seen in the 
decreasing Ha/[OIII] ratio we measure for fainter line 
fluxes (see Section I4T21 below) . 

We find roughly 1.5 times as many 0.7<z<1.5 [Oin] 
emitters, where Ha can also be identified in the WISP 
data, as [Oin] emitters found at the higher 1.5<z<2.3 
redshifts. This is after accounting for the extra incom- 
pleteness [Oin] suffers at the higher redshift (see Sec- 
tion I3.1.ip . In the actual raw counts we find 3 times 
as many 0.7<z<1.5 [Oni] emitters. This number differ- 
ence between redshifts remains fairly constant over the 
l-5x 10~ 16 ergs s -1 cm~ 2 flux range. In the data we 
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see an unexpected crossover towards the brightest end, 
with the highest redshift emitters becoming the most 
populous. However, the number statistics here are quite 
poor, with only 9 total [Oin] emitters covering the 5-10 x 
10" 16 ergs s" 1 cm" 2 flux range, split into the two redshift 
bins. Within the error bars, we find the number counts 
are consistent with no significant change in the number 
ratio between the high and low redshift [Om] emitters 
over the entire flux range. This relatively steady num- 
ber count ratio across a wide range of redshifts suggests 
that the luminosity of the typical [Om] emitter (L*) is 
increasing strongly with redshift, something we examine 
further when we look at the [Om] luminosity function in 
Section H31 

4.2. Ha/ [OIII] Ratio 

Over the near-infrared wavelength range probed by the 
WISP survey, 0.85-1.65/zm, the vast majority of the emis- 
sion lines observed are either Ha or [OIII]. When other 
emission lines are identified they are always found at 
the same time as one of those two lines, largely because 
these two emission lines are almost always significantly 
stronger than the other available lines. Besides the pos- 
sibility of a rare [On] emitter (see Section I3~3l above) . our 
simulations suggest that the assumption that all single 
line emitters are Ha mainly results in the misidentifi- 
cation of [Om] emission. While these failed identifica- 
tions only effect 6% of Ha lines, it has a large effect on 
the high redshift (z>1.5) [OIII]-emitters, where we lose 
nearly 50% from our final sample. 

The future near-infrared grism space missions are likely 
to suffer similar single emission line misidentification is- 
sues. Increased wavelength resolution will certainly aid 
in identification, as a resolved [Om] 5007+4959 A dou- 
blet will not be confused with Ha. However, for grism 
spectroscopy wavelength resolution is not the only lim- 
iting factor. If the emitting region is large in spatial 
extent it will smear out the doublet, effectively lowering 
the available wavelength resolution. While the median 
galaxy radius of our sample is only 0.2", the future wide- 
area grism missions will reach shallower flux depths and 
therefore should find larger objects, as there is a strong 
correlation between line flux and galaxy size (see Figure 

hq}. 

Down to 3x 10 -16 ergs s -1 cm -2 and limiting our anal- 
ysis to redshifts of z >1.2 (where Ha /[Om] confusion 
becomes an issue at 1 . 1 /im) , we find the radius of the me- 
dian object increases 50% to 0.3". While the sample of 
objects with both high fluxes and high redshifts is small, 
a full 25% (4 of 16) have radii larger than 0.5". Fortu- 
nately the pixel scale for the Euclid mission will be 0.3" 
pixel -1 , more than twice that of the IR WFC3, which 
should greatly mitigate the degradation of wavelength 
resolution caused by extended sources. While the exact 
pixel scale for WFIRST detectors remains uncertain, it 
will likely be close to that of the WFC3 IR channel, 0.13" 
pixel -1 . However, the proposed WFIRST wavelength 
resolution (R=600) is large enough that the wavelength 
smearing should not prevent identification of the [Om] 
5007+4959 A doublet in most cases. 

To investigate the likelihood of single emission line con- 
fusion further, we plot the ratio of Ha/[Oni] as a function 
of Ha flux in Figure QTJ We limit our redshift range to 



□ 

□ 

□ 
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Fig. 10. — Flux versus radius for our entire sample of line emit- 
ters, EW > 40 A and S/N > 5. The flux given is Ha emission except 
in cases where Ho is not detected (mostly at z>1.5), in which case 
the flux presented is [Om] . Emission line galaxies with redshifts of 
2>1.2 are red triangles, while those with redshifts z<1.2 are blue 
squares. 




Ha flux (ergs s~ cm" 2 ) 



Fig. 11. — Ratio of Ha divided by [OIII] line flux versus Ha flux 
for all detected 0.7<z<1.5 line emitters. Solid black squares are 
sources in which both lines are detected. The empty blue circles 
are sources where only Ha is detected and are lower limits, while 
the red triangles are the two sources where only [OIII] has been 
detected, with its two arrows pointing towards the quadrant those 
points must really lie. The orange wedge represents the area where 
one would expect to find only a single Ha emission line, assuming 
an Ha detection limit of 3xl0 — 16 ergs s — 1 cm -2 (planned Euclid 
survey limit) and a fainter [OIII] detection limit of 1.7xl0 — 16 ergs 

— 1—2 

s 1 cm . 

that for which both Ha and [Om] can be detected. Up- 
per limits are 2-ct, assuming the FWHM of the line for 
which a limit is being derived is the same as for the mea- 
sured emission line. We find only two cases where [Om] 
is detected without Ha. 

The upper left part of Figure [Til where the Ha/[Oin] 
ratio is large and Ha fluxes are small, is empty of sources 
because that is where the flux limits of the survey pre- 
vent sampling. If the high Ha/[Oin] ratios we see at 
brighter fluxes continue down to fainter fluxes, then pre- 



12 



Colbert et al. 



sumably we would find many of the lower limits moving 
into that region if we were sensitive enough to detect 
such faint [Oiii] fluxes. The lower right portion of Figure 
fTTl where Ha/fOlll] ratio is small and the Ha fluxes are 
bright, is similarly devoid of objects, but that region is 
not excluded by our sensitivity limits. There is a real 
scarcity of low Ha/[Oni] sources at the bright flux lim- 
its. Excluding three lower limits, we find none of the 58 
galaxies with Ha fluxes greater than 3xl0 -16 ergs s _1 
cm -2 have ratios of Ha/[0m] < 0.95. 

While the WISP survey does not cover the same wave- 
length range as the future space near-infrared grism mis- 
sions (WISP goes to shorter wavelengths while the future 
missions will go longer), it does give us a rough estimate 
of the percentage of single line emitters those missions 
are likely to find. The exact percentage of single line 
emitters will depend on their final flux limits and crite- 
ria for establishing the reality of an emission line. For 
this example we again choose the presently planned de- 
tection limit for Euclid (3.5<r limit of 3xl0 -16 ergs s _1 
cm~ 2 ) and assume a secondary line only needs to be de- 
tected at a 2<T significance (1.7 xlO -16 ergs s _1 cm -2 ). 
The shaded orange region on Figure [IT] shows the region 
for which only single line emitters would be expected for 
these sets of limits. We find that at least 60% (35/58) of 
sources with emission lines brighter than 3xl0 -16 ergs 
s _1 cm~ 2 will be single line emitters in future surveys. 
This single line discovery rate could potentially be as 
high as 80% if all the lower limits presented actually lie 
within the single line discovery region. 

We can also look at a sample where the redshift is 
z>1.2, as that would better reflect the range over which 
[Oiii] would be detectable if the lower wavelength cut-off 
is the 1.1 jum cut-off of Euclid rather than the 0.85 /im 
of the WISP survey. The number of bright emitters is 
much smaller (12) but the percentage of expected single 
line emitters, 67-83%, is virtually the same as when we 
included the lower redshifts emitters. 

Our analysis assumes all the single emission lines are 
Ha, which we know to be incorrect roughly 6% of the 
time. Therefore it is possible that approximately 2 of 
the 31 single line emitters in that flux range could be 
low Ha/ [Oiii] sources, although if their true redshifts are 
greater than z — 1.5 then their Ha fluxes and Ha/[Oin] 
ratios are not constrained. It is important to emphasize 
that of the 25 sources with bright fluxes where we detect 
both Ha and [Om], none of them have Ha/[Om] < 1. 
There is no obvious reason why we would be missing all 
the galaxies with low Ha/[Oni] from 0.7< z <1.5, unless 
the ratio is so low that we lose the ability to detect Ha. 
However that would require an unlikely large gap in the 
Ha/ [Oiii] ratio, making all galaxies either strong Ha or 
strong [Oin]-emitters, with none in between. 

Regardless of the exact quantity of contamination from 
single line [Oiii] emission, clearly Ha/[Oni] < 1 sources 
are much rarer at the bright fluxes that future near- 
infrared grism missions will probe than in our fainter 
sample. Therefore the assumption that all single line 
emitters are Ha should produce very few contaminat- 
ing interlopers for these space missions, although further 
data is required to better establish exactly what the [Om] 
contamination rate is. 

The trend of increasing Ha/[Oin] ratio with Ha flux is 



a result of the known correlation between the Ha/[Oin] 
ratio and Ha luminosity previously reported by WISP 
(Dommguez et al. 2013) Their analysis also indicates 
that dust extinction increases with Ha luminosity, mean- 
ing that dust extinction is suppressing [Oiii] emission 
more at the bright luminosity end. However, the trend 
of stronger relative [Om] with fainter Ha luminosity 
is far stronger than their measured effects of extinc- 
tion. The dominant determinant of the Ha/[Oin] ratio is 
most likely the metallicity, as the [Oiii] 5007 A/H/3 ratio 
strongly correlates with metallicity in the local universe 
(i.e., Liang et al. 2006). Metallicity is known to correlate 
with mass and luminosity (Tremonti et al. 2004, Kob- 
ulnicky & Kewley 2004), so it is not surprising that the 
brightest Ha emitters, which tend to be the most lumi- 
nous in the continuum as well, would also have higher 
Ha/ [Oiii] ratios. 

4.3. Luminosity Functions 

Using the total volume densities derived for each lu- 
minosity bin (see Section 4.1) we are able to plot Ha 
luminosity functions in Figure [12] We have split the 
Ha emission line luminosity function into two redshift 
ranges, 0.3<z<0.9 and 0.9<z<1.5. We chose to split the 
sample at z=0.9 as that divides the detected Ha emis- 
sion lines by the grism with which they were detected 
(the dividing line between G102 and G141 is ~1.2/rai), 
which also has the added benefit of producing nearly even 
sample sizes (451 0.3<z<0.9 versus 534 0.9<z<1.5). We 
make no correction for dust extinction, but there is a 
correction factor of 0.71 applied to the luminosities to 
account for Nil contamination. Our best-fit Schechter 
parameters are presented in Table 2. 

Generally we find that the 0.9<z<1.5 luminosity func- 
tion is slightly higher at the faint luminosity end than the 
previous NICMOS grism studies, such as those by Shim 
et al. (2009) and Yan et al. (1999). This slight surplus 
in number density shrinks as the Ha luminosities be- 
come brighter, before dropping below the previous NIC- 
MOS surveys by logL=42.5. The faint end differences are 
likely explained by the greater sensitivity of the WISP 
survey to lower line equivalent widths (EWs). We mea- 
sure lines down to an observed EW of 40 A, while the 
typical NICMOS grism survey did not find lines much 
below 80-100 A (Shim et al. 2009) . We find that 15% 
of our total unweighted sample has EWs less 80 A. The 
redshift sampling difference (0.7<z<1.4 with NICMOS 
versus 0.9<z<1.5 with WFC3) is not sufficiently differ- 
ent to effect the luminosity function comparison. 

We also find our 0.9<z<1.5 number densities to be con- 
sistently higher (1.3-2 x) than the 0=1.47 number densi- 
ties derived from the narrow band Ha surveys (Sobral et 
al. 2012). The redshifts sampled by Sobral et al. (2012) 
- at z=0.84 and 1.47 - straddle the sampled range of 
our high redshift sample, so if all the luminosity func- 
tions were consistent, then the WISP luminosity func- 
tion ought to lie between the two Sobral et al. (2012) 
distributions, not above them both. The HiZELS data 
sample down to very similar EWs (25 A), so that is not 
a likely source of the difference. Neither data sample has 
made any attempt to remove AGN, so that can not be a 
source of difference either. Our lower redshift 0.3<z<0.9 
sample, on the other hand, almost perfectly overlaps the 
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Fig. 12. — He* luminosity function derived from WISP emission 
line sample. The red squares are line emitters from 0.3<2<0.9, 
while the blue circles are line emitters from 0.9<z<1.5. Best-fit 
Schechter functions have been overlaid on top with solid blue and 
red lines. The empty circles and squares are raw counts, while the 
solid circles and squares have been corrected for completeness. The 
diamonds and dashed line are the measured points and Schechter fit 
from the NICMOS grism study of Shim et al. (2009) for 0.7<2<1.4. 
The triangles and stars are from the HiZELS narrow band study 
(Sobral et al. 2012) at 2=0.84 and 2=1.47 respectively. The dotted 
line is the 2=0.07 Ha luminosity function (Ly et al. 2007). 

Sobral et al. (2012) narrow band z—0.84 number counts. 
However, with a median z—0.6 for our lower redshift Ha- 
emitters and the observed trend of rising luminosity func- 
tions across these redshift ranges, one would expect to 
find our Ha counts slightly below those of the z=0.84 
narrow band survey. That the number densities instead 
overlap suggests that the WISP grism survey is finding 
slightly more line emitters than the narrow band, al- 
though the absolute differences are small. The expected 
cosmological variance for the Sobral et al. (2012) narrow 
band counts is on the order of 20% (Trenti & Stiavelli 
2008), so that can not entirely explain the difference. 

Regardless of how our sample selection may differ from 
other surveys, we can robustly compare how the luminos- 
ity function evolves within the WISP survey as we the 
same methodology for selection and completeness cor- 
rections. It appears that most of the evolution from 
z =0.3-1.5 takes place at the bright end of the luminos- 
ity function. Within the errors, the luminosity functions 
are converging below logL=41. However, with only two 
low luminosity bins for the high redshift sample, this low 
luminosity covergence is not definitive. Further discus- 
sion of the Ha luminosity functions, their implied star 
formation rates, and the evolution of the star formation 
density, will be presented in a separate paper (Bunker et 
al. 2013, in prep). 

We also present the [Oin] luminosity functions in Fig- 
ure [T3] along with their accompanying best- fit Schechter 
parameters in Table 1. For [Oni] we split luminos- 
ity functions into two redshift ranges, 0.7<z<1.5 and 
1.5<z<2.3, which are the highest redshifts for which the 
[Oiii] luminosity function has been determined to date. 
The lower redshift bins cover the range over which we 
expect to also find the Ha line within the grism spec- 
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Fig. 13. — [Oiii] luminosity function derived from WISP emission 
line sample. The red squares are [Oiii] emitters from 0.7<2<1.5, 
while the blue circles are 1.5<2<2.3. We plot Schecter function 
fits with both a fixed a = -1.5 (solid lines) and one where we allow 
a to vary (dashed lines). The empty circles and squares are raw 
counts, while the solid circles and squares have been corrected for 
completeness. The triangles and solid line fit are from the 2=0.1- 
0.9 PEARS Grism Survey (Pirzkal et al. 2012), while the diamonds 
and dotted line fit are from the 2=0.84 narrow band study of Ly 
et al. (2007). 

tra. For z>1.5, Ha is redshifted beyond 1.65/im, leaving 
[Oiii] as the most powerful detectable line in the grism 
spectrum. The total sample sizes are 192 [OIII]-emitters 
at 0.7<z<1.5 versus 58 at 1.5<z <2.3. 

As remarked in Section 13.1. 1[ the [Oiii] number den- 
sities require higher completeness corrections than for 
Ha. Even after application of these additional larger 
corrections, we still find that the lowest luminosity bin 
for each redshift range appears low, indicating that there 
is some incompleteness for which we are not accounting. 
One possibility is that the number of mis-identified sin- 
gle line [Oiii]-emitters is even larger at the faintest ends 
than our simulations indicate. For instance, our simu- 
lations assume Ha/[0ni] ratios similar to the emission 
line sources that we find and can measure. Therefore, 
if a population of faint, low Ha/[0m] sources exist, we 
would be underestimateing the number of single line mis- 
identifications at the lower redshifts (at higher redshifts 
the Ha/ [Oiii] ratio does not matter as we can not see 
Ha at all). Regardless of the reason for the low faintest 
bin, we exclude them from the Schechter fits and further 
analysis. 

Comparing our [Oiii] luminosity functions to those de- 
rived at lower redshift (Pirzkal et al. 2012, Ly et al. 
2007), we see a strong increase in L* with redshift along 
with decrease in However, our measured luminosity 
bins do not constrain a well. If we require the 0.7<z<1.5 
a to be the same as seen at low redshift (a=-1.5), we find 
little significant change in with all the evolution in 
[OIII] from z=0.7-2.3 taking place in L*. 

5. SUMMARY AND CONCLUSIONS 

We present near-infrared emission line number den- 
sities, line ratios, and luminosity functions, based on 
29 fields from the WISP survey taken with both the 
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TABLE 2 

Luminosity Function Best-Fit Schechter Parameters 



Emission Line Redshift Range 



Ha 0.3<z<0.9 -2.27 ± 0.11 41.72 

Ha 0.9<z<1.5 -2.40 ± 0.12 42.13 

OIII] 0.7<z<1.5 -3.00 ± 0.21 42.31 

OIII] 1.5<z<2.3 -3.45 ± 0.40 42.88 

OIII] 0.7<z<1.5 -3.15 ± 0.08 42.40 

OIII 1.5<z<2.3 -3.37 ± 0.14 42.84 



G102+G141 grism filters. With this dataset we find 
emission line galaxies without any preselection bias, over 
a large continuous epoch of cosmic time. At least two fu- 
ture space missions, Euclid and WFIRST, will also con- 
tain very large near-infrared grism surveys. The WISP 
survey is a perfect laboratory to predict what these fu- 
ture missions can expect to find. 

We found the most significant potential issues with 
identifying emission lines from near-infrared grism spec- 
troscopy to be: 

• Confusion of Ha and [Om] due to insufficient 
wavelength resolution, which can lead to catas- 
trophic redshift identification failures. When the 
signal-to-noise gets low enough, one can not neces- 
sarily depend on being able to distinguish the [Om] 
doublet from a single emission line. This is par- 
ticularly true in the case where the emission line 
region is spatially extended, which effectively de- 
grades the wavelength resolution. This could po- 
tentially be a significant issue for shallower surveys, 
as the brighter emission line sources tend to also 
be larger in spatial extent. However, we found this 
line confusion to be almost entirely confined to the 
lower resolution (R<~130) G141 grism and not an 
issue for the higher resolution (R~210) G102 grism. 
Additionally our simulations found that the higher 
grism resolution is also needed to achieve the 0.1% 
accuracy in l+z required for baryonic acoustic os- 
cillation experiments. We would therefore strongly 
recommend that future near-IR missions keep to 
wavelength resolutions above R>200. 

• Contamination from overlapping spectra 
from bright sources, which effects even bright, 
high signal-to-noise spectra. It alters the mea- 
sured continuum, impacts the effectiveness of auto- 
matic line finding algorithms, makes it difficult to 
identify the correct host galaxy and emission line 
wavelength, and results in emission lines that are 
lost altogether. Roughly 5% of all our bright lines 
(>3xlCP 16 ergs s _1 cm~ 2 ) are lost to overlapping 
bright source contamination. The use of multiple 
roll angles would greatly minimize these effects, 
particularly for the brighter, less densely-packed 
sources mostly being targeted by future missions. 

• Lack of dithering, which not only greatly im- 
pacts our ability to mitigate cosmic rays, hot and 
warm pixels, and other artifacts that can mimic 
emission lines, but also prevents us from recover- 
ing the additional resolution that sub-pixel dither- 
ing can provide. With grism spectroscopy, this re- 
sults in not only improved spatial resolution, but 



L* a 

i ± 0.09 -1.27 ±0.10 

! ± 0.09 -1.34 ± 0.16 

. ± 0.17 -1.32 ± 0.36 

I ± 0.35 -1.59 ± 0.77 

) ± 0.08 -1.5 (fixed) 

I ± 0.11 -1.5 (fixed) 



also improved wavelength resolution, which helps 
to address the insufficient wavelength resolution is- 
sue already discussed above. 

While WISP is sensitive to Ha emitters down to z=0.3, 
both Euclid and WFIRST will likely have low wavelength 
cut-offs around 1.1 /urn making them mostly sensitive 
to emission line galaxies at z>0.7. We find that our 
cumulative number of Ha galaxies at 0.7<z<1.5 reaches 
10,000 deg~ 2 by an Ha flux of 2x 10~ 16 ergs s" 1 cm~ 2 , in 
agreement with the corrected count predictions of Geach 
et al. (2010). Ha-emitting galaxies with comparable 
[Om] flux are roughly five times less common at these 
(<~l-3 xl0~ 16 ergs s _1 cm~ 2 ) emission line fluxes. 

As emission line fluxes become fainter the Ha/[Om] ra- 
tio becomes smaller, largely a result of the correlation be- 
tween the Ha/[Om] ratio and Ha luminosity previously 
reported by WISP (Dommguez et al. 2013). While great 
numbers of Ha/[Oin]<l emission line galaxies can be 
found around 5x 10 -17 ergs -1 cm~ 2 , there are effectively 
none in our sample once the Ha flux becomes greater 
than 2xl0~ 16 ergs -1 cm~ 2 . These large Ha/[Om] ra- 
tios for the brighter emission lines suggest that 60-80% 
of the Ha emission lines found by future space missions 
will be single lines. However, that same high Ha/[Om] 
ratio suggests that the likelihood of contamination by 
lines besides Ha remains low. 

Our Ha luminosity function found a slightly higher 
number density of faint line emitters than the NICMOS 
near-infrared grism surveys, likely a result of the smaller 
equivalent widths to which the WISP survey is sensi- 
tive. We also find our 0.9<z<1.5 number counts to 
be a bit higher (1.3-2x) than the 0=1.47 narrow band 
Ha survey of HiZELS (Sobral et al. 2012), which can 
not be explained by their minor difference in equiva- 
lent width cut-offs. On the other hand, our lower red- 
shift 0.3<z<0.9 Ha-emitter number counts are in solid 
agreement with the narrow band counts from HiZELS 
at z=0.84. However, the lower median redshift of the 
0.3<z<0.9 WISP sample suggests that we should find 
slightly fewer galaxies than HiZELS for the lower red- 
shift sample as well. Taken together, these luminosity 
function differences suggest that narrow band surveys 
may be missing some galaxies, although the differences 
are small enough that the count differences could also be 
consistent with a 2-3 a statistical outlier. 

The [Om] emission line probes a higher redshift range 
than Ha and is therefore not sensitive to the fainter lu- 
minosities required to properly constrain a. If we fix a=- 
1.5, as seen in studies at lower redshifts, we find that the 
evolution in the [Om] luminosity function from z=0.7- 
2.3 is almost entirely in the L* parameter, which steadily 
increases with redshift over the sampled range. 
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